PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa20g081120.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 981aa    MW: 108528 Da    PI: 4.6419
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa20g081120.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.61.4e-20231286156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++ +++t+ q++e+e+lF++n++p+ ++r++L+++lgL+ rqVk+WFqNrR+++k
  Csa20g081120.1 231 KKRYHRHTNRQIQEMEALFKENPHPDDKQRKRLSAELGLKPRQVKFWFQNRRTQMK 286
                     688999***********************************************998 PP

2START146.13.4e-464416741206
                     HHHHHHHHHHHHHHHHC-TT-EEEE........EXCCTTEEEEEEESSS...........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S CS
           START   1 elaeeaaqelvkkalaeepgWvkss........esengdevlqkfeeskv..........dsgealrasgvvdmvlallveellddkeqWdetla 77 
                     e+a ++ qel k+ +aeep+W k            +n++e+++ f+              +  ea++a  vv+m++ +lv  +l+   +W+e++ 
  Csa20g081120.1 441 EIAVSCVQELTKMCDAEEPLWIKKKsdkisgeiLCLNEEEYMRLFP---WpvenhnnkadFAREASKANSVVIMNSITLVDAFLNAD-KWSEMFC 531
                     578899***************9999777776554456666666664...033556699999**************************.******9 PP

                     ....EEEEEEEECTT.....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE..-TTS--..-TTSEE-EESSEEEEEEEEC CS
           START  78 ....kaetlevissg.....galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvd..seqkppe.sssvvRaellpSgiliepks 158
                         +a+t++ issg     g l lm+ae+q+lsplvp R+ +f+Ry +q  + g w+ivd  +d  ++q++p  + s    ++ pSg++i++++
  Csa20g081120.1 532 sivaRAKTVQIISSGvsgasGSLLLMYAEFQVLSPLVPtREAYFLRYVEQnAENGNWAIVDFPIDsfHDQMQPPsTNSPHEYKRKPSGCIIQDMP 626
                     9999**********************************************99*********99883334444444666667779*********** PP

                     TCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 159 nghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     ng+s+v wvehv+++++++h+ +   vksg+a+ga++w+  lqrqce+
  Csa20g081120.1 627 NGYSQVKWVEHVEVDEKHVHETFAEYVKSGMAFGANRWLDVLQRQCER 674
                     **********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.85E-20220289IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.4E-21225295IPR009057Homeodomain-like
PROSITE profilePS5007117.589228288IPR001356Homeobox domain
SMARTSM003891.4E-18229292IPR001356Homeobox domain
CDDcd000862.04E-19231289No hitNo description
PfamPF000464.4E-18231286IPR001356Homeobox domain
PROSITE patternPS000270263286IPR017970Homeobox, conserved site
PROSITE profilePS5084843.887432677IPR002913START domain
SuperFamilySSF559611.37E-30434676No hitNo description
CDDcd088756.81E-107436673No hitNo description
SMARTSM002342.6E-26441674IPR002913START domain
PfamPF018521.3E-38442674IPR002913START domain
SuperFamilySSF559616.73E-10701849No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 981 aa     Download sequence    Send to blast
MLSIGEGNVM TSDNMRFASQ QPSSSSPGTI QNPNFNFIPF NSFSSIIPKE EHGMMSMMMM  60
MGDGTVEEMM ENGSAGGSFG SGSEQAEDPK FGNESDVNEL QDDEQPPPAK KKRYHRHTNL  120
MLSIGEGNVM TSDNMRFASQ QPSSSSPGTI QNPNFNFIPF NSFSSIIPKE EHGMMSMMMM  180
MGDGTVEEMM ENGSAGGSFG SGSEQAEDPK FGNESDVNEL QDDEQPPPAK KKRYHRHTNR  240
QIQEMEALFK ENPHPDDKQR KRLSAELGLK PRQVKFWFQN RRTQMKAQQD RTENAMLRAE  300
NANLKSENCH LQGELRCLSC PSCGGPTVLG DIPFNELHIE NCRLREELDR ICCITSRYTG  360
RPMQSMPSSQ PLIDPSSTLP HHQPSLELDM SVYAGNFPEH SCADMMMLPS QDTTCFFPDQ  420
TVNNNNNNML LAEEEKVIAM EIAVSCVQEL TKMCDAEEPL WIKKKSDKIS GEILCLNEEE  480
YMRLFPWPVE NHNNKADFAR EASKANSVVI MNSITLVDAF LNADKWSEMF CSIVARAKTV  540
QIISSGVSGA SGSLLLMYAE FQVLSPLVPT REAYFLRYVE QNAENGNWAI VDFPIDSFHD  600
QMQPPSTNSP HEYKRKPSGC IIQDMPNGYS QVKWVEHVEV DEKHVHETFA EYVKSGMAFG  660
ANRWLDVLQR QCERIASLMA RNITDLGGYD IISRSEEKHD EVITEDDTVR ITTRKMCEPG  720
QPTGVLLSAV STTWLPFTHH QVFDLIRDQH HQSLLEVLFN GNSPHEVAHI ANGSHPGNCI  780
SLLRINVASN SWHNVELMLQ ESSIDNSGSL IVYSTVDVDS IQLAMNGEDS SNIPILPLGF  840
SIVPVASNSW HNVELMLQES SIDNSGSLIV YSTVDVDSIQ LAMNGEDSSN IPILPLGFSI  900
VPVNPPEGIS VNSNSPPSCL LTVAIQVLAS NVPTAKPNLS TVTTINNHLC ATVNQITSAL  960
TSSVTPAIAS SAAVSKQEAS *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1256261DKQRKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAB0133940.0AB013394.1 Arabidopsis thaliana genomic DNA, chromosome 5, P1 clone:MQD22.
GenBankCP0026880.0CP002688.1 Arabidopsis thaliana chromosome 5 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010494885.10.0PREDICTED: homeobox-leucine zipper protein HDG5-like isoform X1
RefseqXP_010494886.10.0PREDICTED: homeobox-leucine zipper protein HDG5-like isoform X2
RefseqXP_010494887.10.0PREDICTED: homeobox-leucine zipper protein HDG5-like isoform X2
RefseqXP_010494888.10.0PREDICTED: homeobox-leucine zipper protein HDG5-like isoform X2
RefseqXP_010494890.10.0PREDICTED: homeobox-leucine zipper protein HDG5-like isoform X3
RefseqXP_010494891.10.0PREDICTED: homeobox-leucine zipper protein HDG5-like isoform X4
SwissprotQ9FJS20.0HDG5_ARATH; Homeobox-leucine zipper protein HDG5
TrEMBLR0GUB60.0R0GUB6_9BRAS; Uncharacterized protein
STRINGAT5G46880.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM43562548
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7